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Abstract.  The  degeneracy  of  an  //-vertex  graph  G  is  the  smallest  number  d  such 
that  every  subgraph  of  G  contains  a  vertex  of  degree  at  most  d.  We  show  that  there 
exists  a  nearly-optimal  fixed-parameter  tractable  algorithm  for  enumerating  all 
maximal  cliques,  parametrized  by  degeneracy.  To  achieve  this  result,  we  modify 
the  classic  Bron-Kerbosch  algorithm  and  show  that  it  runs  in  time  0(dn3dl 3).  We 
also  provide  matching  upper  and  lower  bounds  showing  that  the  largest  possible 
number  of  maximal  cliques  in  an  /7-vertex  graph  with  degeneracy  d  (when  d  is  a 
multiple  of  3  and  n  >  d  +  3)  is  (n  —  d) 3d^ .  Therefore,  our  algorithm  matches  the 
&(d(n  —  d) 3^' 3)  worst-case  output  size  of  the  problem  whenever  n  —  d  =  Q(n). 

Keywords:  sparse  graphs,  4-degenerate  graphs,  maximal  clique  listing  algo¬ 
rithms,  Bron-Kerbosch  algorithm,  fixed-parameter  tractability 


1  Introduction 


Cliques,  complete  subgraphs  of  a  graph,  are  of  great  importance  in  many  applications.  In 
social  networks  cliques  may  represent  closely  connected  clusters  of  actors  161 141281401 
and  may  be  used  as  features  in  exponential  random  graph  models  for  statistical  analysis 
of  social  networks  11 71 19120144 1491.  In  bioinformatics,  clique  finding  procedures  have 
been  used  to  detect  structural  motifs  from  protein  similarities  B26I35I36I,  to  predict 
unknown  protein  structures  EH,  and  to  determine  the  docking  regions  where  two 
biomolecules  may  connect  to  each  other  ll22l.  Clique  finding  problems  also  arise  in 
document  clustering  0,  in  the  recovery  of  depth  from  stereoscopic  image  data  |29l.  in 
computational  topology  l52l.  and  in  e-commerce,  in  the  discovery  of  patterns  of  items 
that  are  frequently  purchased  together  ED. 

Often,  it  is  important  to  find  not  just  one  large  clique,  but  all  maximal  cliques.  Many 
algorithms  are  now  known  for  this  problem  11I7I9I10I1 1I23I28I32I41I43I47I  and  for  the 
complementary  problem  of  finding  maximal  independent  sets  11613 1 13713914811 .  One  of 
the  most  successful  in  practice  is  the  Bron-Kerbosch  algorithm,  a  simple  backtracking 
procedure  that  recursively  solves  subproblems  specified  by  three  sets  of  vertices:  the 
vertices  that  are  required  to  be  included  in  a  partial  clique,  the  vertices  that  are  to  be 
excluded  from  the  clique,  and  some  remaining  vertices  whose  status  still  needs  to  be 


determined  17191321351471. 

All  maximal  cliques  can  be  listed  in  polynomial  time  per  clique  1371481  or  in  a 
total  time  proportional  to  the  maximum  possible  number  of  cliques  in  an  /7-vertex 
graph,  without  additional  polynomial  factors  E|47|.  In  particular,  a  variant  of  the 
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Bron-Kerbosch  algorithm  is  known  to  be  optimal  in  this  sense  @32  ■  Unfortunately  this 
maximum  possible  number  of  cliques  is  exponential  ll42l.  so  that  all  general-purpose 
algorithms  for  listing  maximal  cliques  necessarily  take  exponential  time. 

We  are  faced  with  a  dichotomy  between  theory,  which  states  categorically  that  clique 
finding  takes  exponential  time,  and  practice,  according  to  which  clique  finding  is  useful 
and  can  be  efficient  in  its  areas  of  application.  One  standard  way  of  resolving  dilemmas 
such  as  this  one  is  to  apply  parametrized  complexity  ED:  one  seeks  a  parameter  of 
instance  complexity  such  that  instances  with  small  parameter  values  can  be  solved 
quickly.  A  parametrized  problem  is  said  to  b &  fixed-parameter  tractable  if  instances  with 
size  n  and  parameter  value  p  can  be  solved  in  a  time  bound  of  the  form  f(p)n0-i  \  where 
/  may  grow  exponentially  or  worse  with  p  but  is  independent  of  n.  With  this  style  of 
analysis,  instances  with  a  small  parameter  value  are  used  to  model  problems  that  can 
be  solved  quickly,  while  instances  with  a  large  parameter  value  represent  a  theoretical 
worst  case  that,  one  hopes,  does  not  arise  in  practice. 

The  size  of  the  largest  clique  does  not  work  well  as  a  parameter:  the  maximum  clique 
problem,  parametrized  by  clique  size,  is  hard  for  W[l],  implying  that  it  is  unlikely  to 
have  a  fixed-parameter  tractable  algorithm  E2,  and  Turan  graphs  have  ( n/k)k 

maximal  cliques  of  size  k  forcing  any  algorithm  that  lists  them  all  to  take  time  larger  than 
any  fixed-parameter-tractable  bound.  However,  clique  size  is  not  the  only  parameter  one 
can  choose.  In  this  paper,  we  study  maximal  clique  finding  parametrized  by  degeneracy, 
a  frequently-used  measure  of  the  sparseness  of  a  graph  that  is  closely  related  to  other 
common  sparsity  measures  such  as  arboricity  and  thickness,  and  that  has  previously  been 
used  for  other  fixed-parameter  problems  Il2l8l25l34ll.  We  are  motivated  by  the  fact  that 
sparse  graphs  often  appear  in  practice.  For  instance,  the  World  Wide  Web  graph,  citation 
networks,  and  collaboration  graphs  have  low  arboricity  lf24l.  and  therefore  have  low 
degeneracy.  Empirical  evidence  also  suggests  that  the  /7-index,  a  measure  of  sparsity  that 
upper  bounds  degeneracy,  is  low  for  social  networks  ITT).  As  we  show  in  Appendix [Aj 
protein-protein  interaction  networks  have  low  degeneracy  as  well. Furthermore,  planar 
graphs  have  degeneracy  at  most  five  ||38l,  and  the  Barabasi-Albert  model  of  preferential 
attachment  0,  frequently  used  as  a  model  for  large  scale-free  social  networks,  produces 
graphs  with  bounded  degeneracy.  We  show  that: 

-  A  variant  of  the  Bron-Kerbosch  algorithm,  when  applied  to  77-vertex  graphs  with 
degeneracy  d,  lists  all  maximal  cliques  in  time  0(dn3d /3). 

-  Every  77-vertex  graph  with  degeneracy  d  (where  d  is  a  multiple  of  three  and  n>d  +  3) 
has  at  most  (77  —d) 3d  ^  maximal  cliques,  and  there  exists  an  77-vertex  graph  with 
degeneracy  d  that  has  exactly  (77  —d) 3d ^  maximal  cliques.  Therefore,  our  variant 
of  the  Bron-Kerbosch  algorithm  is  optimal  in  the  sense  that  its  time  is  within  a 
constant  of  the  parametrized  worst-case  output  size. 

Our  algorithms  are  fixed-parameter  tractable,  with  a  running  time  of  the  form  0(f(d)n) 
where  f(d)  =  d3d^ .  Algorithms  for  listing  all  maximal  cliques  in  graphs  of  constant  de¬ 
generacy  in  time  O(n)  were  already  known  II 101 1 II.  but  these  algorithms  had  not  been  an¬ 
alyzed  for  their  dependence  on  the  degeneracy  of  the  graph.  We  compare  the  parametrized 
running  time  bounds  of  the  known  alternative  algorithms  to  the  running  time  of  our 
variant  of  the  Bron-Kerbosch  algorithm,  and  we  show  that  the  Bron-Kerbosch  algorithm 
has  a  much  smaller  dependence  on  the  parameter  d.  Thus  we  give  theoretical  evidence 
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(a) 


(b) 


Fig.  1.  (a)  A  graph  with  degeneracy  3.  (b)  A  vertex  ordering  showing  that  the  degeneracy 
is  not  larger  than  3. 


for  the  good  performance  for  this  algorithm  that  had  previously  been  demonstrated 
empirically. 

2  Preliminaries 

We  work  with  an  undirected  graph  G  =  (V,E),  which  we  assume  is  stored  in  an  ad¬ 
jacency  list  data  structure.  We  let  n  and  in  be  the  number  of  vertices  and  edges  of  G, 
respectively.  For  a  vertex  v,  we  define  r(v)  to  be  the  set  {w  |  (v,w)  £  E},  which  we  call 
the  neighborhood  of  v,  and  similarly  for  a  subset  W  C  V  we  define  F(W)  to  be  the  set 
nw€lvr(vv),  which  is  the  common  neighborhood  of  all  vertices  in  W. 

2.1  Degeneracy 

Our  algorithm  is  parametrized  on  the  degeneracy  of  a  graph,  a  measure  of  its  sparsity. 

Definition  1  (degeneracy).  The  degeneracy  of  a  graph  G  is  the  smallest  value  d  such 
that  every  nonempty  subgraph  of  G  contains  a  vertex  of  degree  at  most  d  H38]l. 

Figure  [T(aj|  shows  an  example  of  a  graph  of  degeneracy  3.  Degeneracy  is  also  known 
as  the  k-corc  number  0,  width  ED,  and  linkage  f33l  of  a  graph  and  is  one  less  than 
the  coloring  number  ifTHIl.  In  a  graph  of  degeneracy  d,  the  maximum  clique  size  can  be 
at  most  d  +  1,  for  any  larger  clique  would  form  a  subgraph  in  which  all  vertices  have 
degree  higher  than  d. 

If  a  graph  has  degeneracy  d,  then  it  has  a  degeneracy  ordering,  an  ordering  such 
that  each  vertex  has  d  or  fewer  neighbors  that  come  later  in  the  ordering.  Figure  [T(bj| 
shows  a  possible  degeneracy  ordering  for  the  example.  Such  an  ordering  may  be  formed 
from  G  by  repeatedly  removing  a  vertex  of  degree  d  or  less:  by  the  assumption  that 
G  is  £/-degenerate,  at  least  one  such  vertex  exists  at  each  step.  Conversely,  if  G  has  an 
ordering  with  this  property,  then  it  is  ^/-degenerate,  because  for  any  subgraph  H  of  G,  the 
vertex  of  H  that  comes  first  in  the  ordering  has  d  or  fewer  neighbors  in  H.  Thus,  as  Lick 
and  White  ll38l  showed,  the  degeneracy  may  equivalently  be  defined  as  the  minimum 
d  for  which  a  degeneracy  ordering  exists.  A  third,  equivalent  definition  is  that  d  is  the 
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minimum  value  for  which  there  exists  an  orientation  of  G  as  a  directed  acyclic  graph 
in  which  all  vertices  have  out-degree  at  most  d  CD:  such  an  orientation  may  be  found 
by  orienting  each  edge  from  its  earlier  endpoint  to  its  later  endpoint  in  a  degeneracy 
ordering,  and  conversely  if  such  an  orientation  is  given  then  a  degeneracy  ordering  may 
be  found  as  a  topological  ordering  of  the  oriented  graph. 

Degeneracy  is  a  robust  measure  of  sparsity:  it  is  within  a  constant  factor  of  other 
popular  measures  of  sparsity  including  arboricity  and  thickness.  In  addition,  degeneracy, 
along  with  a  degeneracy  ordering,  can  be  computed  by  a  simple  greedy  strategy  of 
repeatedly  removing  a  vertex  with  smallest  degree  (and  its  incident  edges)  from  the 
graph  until  it  is  empty.  The  degeneracy  is  the  maximum  of  the  degrees  of  the  vertices  at 
the  time  they  are  removed  from  the  graph,  and  the  degeneracy  ordering  is  the  order  in 
which  vertices  are  removed  from  the  graph  |[30l.  The  easy  computation  of  degeneracy 
has  made  it  a  useful  tool  in  algorithm  design  and  analysis  CM. 

We  can  implement  this  algorithm  in  0(n+m )  time  by  maintaining  an  array  D,  where 
D[i]  stores  a  list  of  vertices  of  degree  i  in  the  graph  0.  To  remove  a  vertex  of  minimum 
degree  from  the  graph,  we  scan  from  the  beginning  of  the  array  until  we  reach  the  first 
nonempty  list,  remove  a  vertex  from  this  list,  and  then  update  its  neighbors’  degrees  and 
move  them  to  the  correct  lists.  Each  vertex  removal  step  takes  time  proportional  to  the 
degree  of  the  removed  vertex,  and  therefore  the  algorithm  takes  linear  time. 

By  counting  the  maximum  possible  number  of  edges  from  each  vertex  to  later 
neighbors,  we  get  the  following  bound  on  the  number  of  edges  of  a  ^/-degenerate  graph: 

Lemma  1  (Proposition  3  of  11381).  A  graph  G  =  (V,E)  with  degeneracy  d  has  at  most 
d(n  —  edges. 

2.2  The  Bron-Kerbosch  Algorithm 

The  Bron-Kerbosch  algorithm  Q  is  a  widely  used  algorithm  for  finding  all  maximal 
cliques  in  a  graph.  It  is  a  recursive  backtracking  algorithm  which  is  easy  to  understand, 
easy  to  code,  and  has  been  shown  to  work  well  in  practice. 

A  recursive  call  to  the  Bron-Kerbosch  algorithm  provides  three  disjoint  sets  of 
vertices  R,  P,  and  X  as  arguments,  where  R  is  a  (possibly  non-maxima!)  clique  and 
PUX  =T(R)  are  the  vertices  that  are  adjacent  to  every  vertex  in  R.  The  vertices  in  P 
will  be  considered  to  be  added  to  clique  R,  while  those  in  X  must  be  excluded  from  the 
clique;  thus,  within  the  recursive  call,  the  algorithm  lists  all  cliques  in  PUR  that  are 
maximal  within  the  subgraph  induced  by  PGR UX.  The  algorithm  chooses  a  candidate 
v  in  P  to  add  to  the  clique  R.  and  makes  a  recursive  call  in  which  v  has  been  moved  from 
R  to  R\  in  this  recursive  call,  it  restricts  X  to  the  neighbors  of  v,  since  non-neighbors 
cannot  affect  the  maximality  of  the  resulting  cliques.  When  the  recursive  call  returns, 
v  is  moved  to  X  to  eliminate  redundant  work  by  further  calls  to  the  algorithm.  When 
the  recursion  reaches  a  level  at  which  P  and  X  are  empty,  R  is  a  maximal  clique  and  is 
reported  (see  Fig.  [2]».  To  list  all  maximal  cliques  in  the  graph,  this  recursive  algorithm  is 
called  with  P  equal  to  the  set  of  all  vertices  in  the  graph  and  with  R  and  X  empty. 

Bron  and  Kerbosch  also  describe  a  heuristic  called  pivoting,  which  limits  the  number 
of  recursive  calls  made  by  their  algorithm.  The  key  observation  is  that  for  any  vertex 
u  in  PUX,  called  a  pivot,  any  maximal  clique  must  contain  one  of  m’s  non-neighbors 
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proc  BronKerboschfP,  R,  X) 

1:  if  PUX  =  0  then 
2:  report  R  as  a  maximal  clique 

3:  end  if 

4:  for  each  vertex  vePdo 
5:  BronKerbosch(/>nr(v),  RL)  {v},  X  nf(v)) 

6:  P<-P\{v} 

7:  X<-IU  {v} 

8:  end  for 

proc  BronKerboschPivotfP,  R,  X) 

1:  if  PUX  =  0  then 
2:  report  R  as  a  maximal  clique 

3:  end  if 

4:  choose  a  pivot  u  £  PVJX  {Tomita  et  al.  choose  u  to  maximize  |PnF(n)|} 
5:  for  each  vertex  v  G  P\T(u)  do 
6:  BronKerboschPivotfP  nr(v),  PU  {v},  X  nr(v)) 

7:  P^P\{v} 

8:  X^-A:u{v} 

9:  end  for 


Fig.  2.  The  Bron-Kerbosch  algorithm  without  and  with  pivoting 


(counting  u  itself  as  a  non-neighbor).  Therefore,  we  delay  the  vertices  in  Pnr(u)  from 
being  added  to  the  clique  until  future  recursive  calls,  with  the  benefit  that  we  make  fewer 
recursive  calls.  Tomita  et  al.  B7l  show  that  choosing  the  pivot  u  from  P  U  X  in  order 
to  maximize  PnF(«)  guarantees  that  the  Bron-Kerbosch  algorithm  has  worst-case 
running  time  0( 3"/3),  excluding  time  to  write  the  output,  which  is  worst-case  optimal. 

3  The  Algorithm 

In  this  section,  we  show  that  apart  from  the  pivoting  strategy,  the  order  in  which  the 
vertices  of  G  are  processed  by  the  Bron-Kerbosch  algorithm  is  also  important.  By 
choosing  an  ordering  carefully,  we  develop  a  variant  of  the  Bron-Kerbosch  algorithm 
that  correctly  lists  all  maximal  cliques  in  time  0{dn3d /3).  Essentially,  our  algorithm 
performs  the  outer  level  of  recursion  of  the  Bron-Kerbosch  algorithm  without  pivoting, 
using  a  degeneracy  ordering  to  order  the  sequence  of  recursive  calls  made  at  this  level, 
and  then  switches  at  inner  levels  of  recursion  to  the  pivoting  rule  of  Tomita  et  al.  ga¬ 
in  the  original  Bron-Kerbosch  algorithm,  in  each  recursive  call  the  vertices  in  P 
are  considered  for  expansion  one  by  one  (see  line  4  of  BronKerbosch  in  Figure[2]).  The 
order  in  which  the  vertices  are  treated  is  not  specified.  We  first  analyze  what  happens  if 
we  fix  an  order  iq,  V2, . . .  ,v„  on  the  vertices  of  V,  and  use  the  same  order  consistently 
to  loop  through  the  vertices  of  P  in  each  recursive  call  of  the  non-pivoting  version  of 
BronKerbosch. 
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Fig.  3.  Partitioning  the  common  neighbors  of  a  clique  R  into  the  set  P  of  later  vertices  and 
the  set  X  of  remaining  neighbors. 


Observation  1  When  processing  a  clique  R  in  the  ordered  variant  of  Bron—Kerbosch, 
the  common  neighbors  ofR  can  be  partitioned  into  the  set  P  of  vertices  that  come  after 
the  last  vertex  ofR,  and  the  setX  of  remaining  neighbors ,  as  shown  in  Figure  [i] 

Our  algorithm  computes  a  degeneracy  ordering  of  the  given  graph,  and  performs 
the  outermost  recursive  calls  in  the  ordered  variant  of  the  Bron-Kerbosch  algorithm 
(without  pivoting)  for  this  ordering.  The  sets  P  passed  to  each  of  these  recursive  calls 
will  have  at  most  d  elements  in  them,  leading  to  few  recursive  calls  within  each  of  these 
outer  calls.  Below  the  top  level  of  the  recursion  we  switch  from  the  ordered  non-pivoting 
version  of  the  Bron-Kerbosch  algorithm  to  the  pivoting  algorithm  (with  the  same  choice 
of  pivots  as  Tomita  et  al.  l47| )  to  further  control  the  number  of  recursive  calls. 


proc  BronKerboschDegeneracyfV',  E) 

1:  for  each  vertex  v;  in  a  degeneracy  ordering  vo,  vi,  V2,  ...  of  (V,E)  do 
2:  P<-r(vI)n{vi+1,...,v„_i} 

3:  X  <-r(vi)n{vo,...,Vi_i} 

4:  BronKerboschPivotfP,  {v,},  X) 

5:  end  for 


Fig.  4.  Our  algorithm. 


Lemma  2.  The  Bron-Kerbosch  algorithm  using  the  Tomita  et  al.  pivoting  strategy 
generates  all  and  only  maximal  cliques  containing  all  vertices  in  R,  some  vertices  in  P, 
and  no  vertices  in  X,  without  duplication. 

Proof.  See  Tomita  et  al.  l47l.  □ 

Theorem  1.  Algorithm  BronKerboschDegeneracy  generates  all  and  only  maximal 
cliques  without  duplication. 

Proof.  Let  C  be  a  maximal  clique,  and  v  its  earliest  vertex  in  the  degeneracy  order.  By 
Lemma  [2]  C  will  be  reported  (once)  when  processing  v.  When  processing  any  other 
vertex  of  C,  v  will  be  in  X,  so  C  will  not  be  reported.  □ 

To  make  pivot  selection  fast  we  pass  as  an  additional  argument  to  BronKerboschPivot 
a  subgraph  Hpx  of  G  that  has  PGX  as  its  vertices;  an  edge  (u,v)  of  G  is  kept  as  an  edge 
in  Hpx  whenever  at  least  one  of  u  or  v  belongs  to  P  and  both  of  them  belong  to  /J  U  X . 
The  pivot  chosen  according  to  the  pivot  rule  of  Tomita  et  al.  03)  is  then  just  the  vertex 
in  this  graph  with  the  most  neighbors  in  P. 
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Lemma  3.  Whenever  BronKerboschDegeneracy  calls  BronKerboschPivot  it  can  form 
Hpx  in  time  0(d(\P\  +  lA-!)). 

Proof.  The  vertex  set  of  Hpx  is  known  from  P  and  X.  Each  edge  is  among  the  d  outgoing 
edges  from  each  of  its  vertices.  Therefore,  we  can  achieve  the  stated  time  bound  by 
looping  over  each  of  the  d  outgoing  edges  from  each  vertex  in  PUX  and  testing  whether 
each  edge  meets  the  criterion  for  inclusion  in  llpx-  □ 

The  factor  of  d  in  the  time  bound  of  Lemma[3]makes  it  too  slow  for  the  recursive 
part  of  our  algorithm.  Instead  we  show  that  the  subgraph  to  be  passed  to  each  recursive 
call  can  be  computed  quickly  from  the  subgraph  given  to  its  parent  in  the  recursion  tree. 

Lemma  4.  In  a  recursive  call  to  BronKerboschPivot  that  is  passed  the  graph  Hpx  as  an 
auxiliary  argument,  the  sequence  of  graphs  H pnr(v),xc\r(v)  to  be  passed  to  lower-level 
recursive  calls  can  be  computed  in  total  time  <9(|P|3(|P|  +  |AT | ) ). 

Proof.  It  takes  0{\P\  +  |X|)  time  to  identify  the  subsets  PnF(v)  and  inT(v)  by 
examining  the  neighbors  of  v  in  Hpx  ■  Once  these  sets  are  identified,  HPnr(y),xnr(v)  may 
be  constructed  as  a  subgraph  of  Hpx  in  time  0(|P|(|P|  +  |AC  | ) )  by  testing  for  each  edge 
of  Hpx  whether  its  endpoints  belong  to  these  sets.  There  are  0(  \P\  )  graphs  to  construct, 
one  for  each  recursive  call,  hence  the  total  time  bound.  □ 

Lemma  5  (Theorem  3  of  Ii47ll).  Let  T  be  a  function  which  satisfies  the  following  recur¬ 
rence  relation: 

T{p)<[mWLk{kT^P~k^}+dpl  lfp>0 
~  \e  ifP  =  0 

where  p  and  k  are  integers,  such  that  p  >  k,  and  d,e  are  constants  greater  than  zero. 
Then,  T(p)  <  ma x^kT  (p  —  k)}  +  d  p2  =  0( 3p/3). 

Lemma  6.  Let  v  be  a  vertex,  Pv,  be  v  ’s  later  neighbors,  and  Xv  be  v  ’s  earlier  neighbors. 
Then  BronKerboschPivot(Pv,  {v},  X,,)  executes  in  time  0((d  +  |Arv|)3l'p*'l/3),  excluding 
the  time  to  report  the  discovered  maximal  cliques. 

Proof.  Define  D(p,x)  to  be  the  running  time  of  B ron Kerbosch Pivot! I\  ,  {v},  Xv).  where 
p  =  |P,.|,  and  x  =  |^TV | .  We  show  that  D(p,x)  =  0{{d  +x)3p^).  By  the  description  of 
BronKerboschPivot,  D  satisfies  the  following  recurrence  relation: 


D{p,x)  < 


I  max/i{kD(p  —  k,x)j  +  cip2(p  +  x)  if  p  >  0 
\c2  if  p  =  0 


where  c i  and  c2  are  constants  greater  than  0. 

Since  our  graph  has  degeneracy  d,  the  inequality  p  +  x  <  d  +x  always  holds.  Thus, 

D(p,x)  <  max{kD(p  —  k,x)}+c\p2(p  +  x) 
k 


<  (d  +  x)  (  max 


+  c\p‘ 


kD(p  —  k,x) 
d+x 


<  (d+x)  ymax{kT(p  —  k)}+cip" 

=  0((d  +jc)3p/3)  by  letting  d  =  ci ,  e  =  c2  in  Lemma[5] 
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□ 

Theorem  2.  Given  an  n-vertex  graph  G  with  degeneracy  d,  our  algorithm  reports  all 
maximal  cliques  of  G  in  time  0{dn3L 

Proof.  For  each  initial  call  to  BronKerboschPivot  for  each  vertex  v,  we  first  spend  time 
0{d{\Pv\  +  |XV|))  to  set  up  subgraph  Hpv.xv-  Over  the  entire  algorithm  we  spend  time 

£0(d(|Pv|  +  |X»D)  =  0(dm)  =  0(d2n) 

V 

setting  up  these  subgraphs.  The  time  spent  performing  the  recursive  calls  is 

Y,0((d+\Xv\)3\p^3)  =  0((dn  +  m)3d^)  =  0{dn3d/3), 


and  the  time  to  report  all  cliques  is  0{dfi),  where  ll  is  the  number  of  maximal  cliques. 
We  show  in  the  next  section  that  /i  =  {n  —  d)3d/3  in  the  worst  case,  and  therefore  we 
take  time  0(d(n  —  d)3d^)  reporting  cliques  in  the  worst  case.  Therefore,  the  algorithm 
executes  in  time  0(dn3d^).  □ 

This  running  time  is  nearly  worst-case  optimal,  since  there  may  be  0{{n  —  d)3d/3) 
maximal  cliques  in  the  worst  case. 


4  Worst-case  Bounds  on  the  Number  of  Maximal  Cliques 


Theorem  3.  Let  d  be  a  multiple  of  3  and  n  >d  +  3.  Then  the  largest  possible  number 
of  maximal  cliques  in  an  n-vertex  graph  with  degeneracy  d  is  ( n  —  d)  3d/3. 

Proof  We  first  show  that  there  cannot  be  more  than  {n  —  d)3d'3  maximal  cliques.  We 
then  show  that  there  exists  a  graph  that  has  (n  —  d) 3d/3  maximal  cliques. 


An  Upper  Bound.  Consider  a  degeneracy  ordering  of  the  vertices,  in  which  each  vertex 
has  at  most  d  neighbors  that  come  later  in  the  ordering.  For  each  vertex  v  that  is  placed 
among  the  first  n  —  d  —  3  vertices  of  the  degeneracy  ordering,  we  count  the  number  of 
maximal  cliques  such  that  v  is  the  clique  vertex  that  comes  first  in  the  ordering. 

Since  vertex  v  has  at  most  d  later  neighbors,  the  Moon-Moser  bound  PEI  applied 
to  the  subgraph  induced  by  these  later  neighbors  shows  that  they  can  form  at  most  3d /3 
maximal  cliques  with  each  other.  Since  these  vertices  are  all  neighbors  of  v,  v  participates 
in  at  most  3d /3  maximal  cliques  with  its  later  neighbors.  Thus,  the  first  n  —  d  —  3  vertices 
contribute  to  at  most  (, n  —  d  —  3)3d/3  maximal  cliques  total. 

By  the  Moon-Moser  bound,  the  remaining  d  +  3  vertices  in  the  ordering  can  form  at 
most  3<'d+ 3)/3  maximal  cliques.  Therefore,  a  graph  with  degeneracy  d  can  have  no  more 
than  {n  —  d  —  3)3rf/3  +  3^+3^/3  =  {n  —  d) 3d/3  maximal  cliques. 
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Fig.  5.  The  lower  bound  construction  for  d  =  6,  consisting  of  a  Moon-Moser  graph  of  size 
d  on  the  right  (blue  vertices)  and  an  independent  set  of  n  —  d  remaining  vertices  that  are 
each  connected  to  all  of  the  last  d  vertices. 


A  Lower  Bound.  By  a  simple  counting  argument,  we  can  see  that  the  graph  3333 
contains  ( n  —  d) 3d/2  maximal  cliques:  Each  maximal  clique  must  contain  exactly  one 
vertex  from  each  disjoint  independent  set  of  vertices,  and  there  are  (n  —  d)3d/2  ways  of 
forming  a  maximal  clique  by  choosing  one  vertex  from  each  independent  set.  Figure  [5] 
shows  this  construction  for  d  =  6.  We  can  also  see  that  this  graph  is  (/-degenerate  since 
in  any  ordering  of  the  vertices,  the  first  vertex  must  have  d  or  more  later  neighbors,  and 
in  any  ordering  where  the  n  —  d  disjoint  vertices  come  first,  these  first  n  —  d  vertices  have 
exactly  d  later  neighbors,  and  the  last  d  vertices  have  fewer  later  neighbors.  □ 

Relatedly,  a  bound  of  (n  —  d  +  l)2d  on  the  number  of  cliques  (without  assumption 
of  maximality)  in  n-vertex  (/-degenerate  graphs  was  already  known  (50). 

5  Comparison  with  Other  Algorithms 

Chiba  and  Nishizeki  GO)  describe  two  algorithms  for  finding  cliques  in  sparse  graphs. 
The  first  of  these  two  algorithms  reports  all  maximal  cliques  using  0(am)  time  per 
clique,  where  a  is  the  arboricity  of  the  graph,  and  m  is  the  number  of  edges  in  G.  The 
arboricity  is  the  minimum  number  of  edge-disjoint  spanning  forests  into  which  the  graph 
can  be  decomposed  ]27l.  The  degeneracy  of  a  graph  is  closely  related  to  arboricity: 
a  <  d  <  2a  —  1.  In  terms  of  degeneracy,  Chiba  and  Nishizeki’s  algorithm  uses  0(d2n) 
time  per  clique.  Combining  this  with  the  bound  on  the  number  of  cliques  derived  in 
Sectionjdjresults  in  a  worst-case  time  bound  of  0(d2n(n  —  d) 3d/2).  For  constant  d ,  this 
is  a  quadratic  time  bound,  in  contrast  to  the  linear  time  of  our  algorithm. 

Another  algorithm  of  Chiba  and  Nishizeki  ifTOl  lists  cliques  of  order  /  in  time 
0{lal~2m).  It  can  be  adapted  to  enumerate  all  maximal  cliques  in  a  graph  with  degen¬ 
eracy  d  by  first  enumerating  all  cliques  of  order  d  +  1,  d,  ...  down  to  1,  and  remov¬ 
ing  cliques  that  are  not  maximal.  Applying  their  algorithm  directly  to  a  (/-degenerate 
graph  takes  time  0(ldl~ln).  Therefore,  the  running  time  to  find  all  maximal  cliques  is 
Hi<i<d+\  0{indl  ')  =  0(ndd+l).  Like  our  algorithm,  this  is  linear  when  d  is  constant, 
but  with  a  much  worse  dependence  on  the  parameter  d. 

Chrobak  and  Eppstein  ED  list  triangles  and  4-cliques  in  graphs  of  bounded  de¬ 
generacy  by  testing  all  sets  of  two  or  three  later  neighbors  of  each  vertex  according  to 
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a  degeneracy  ordering.  The  same  idea  extends  in  an  obvious  way  to  finding  maximal 
cliques  of  size  greater  than  four,  by  testing  all  subsets  of  later  neighbors  of  each  vertex. 
For  each  vertex  v,  there  are  at  most  2d  subsets  to  test;  each  subset  may  be  tested  for 
being  a  clique  in  time  0{d2),  by  checking  whether  each  of  its  vertices  has  all  the  later 
vertices  in  the  subset  among  its  later  neighbors,  giving  a  total  time  of  0{nd2 2d)  to  list 
all  the  cliques  in  the  graph.  However,  although  this  singly-exponential  time  bound  is 
considerably  faster  than  Chiba  and  Nishizeki,  and  is  close  to  known  bounds  on  the 
number  of  (possibly  non-maximal)  cliques  in  (/-degenerate  graphs  |50|,  it  is  slower  than 
our  algorithm  by  a  factor  that  is  exponential  in  d.  Our  new  algorithm  uses  this  same  idea 
of  searching  among  the  later  neighbors  in  a  degeneracy  order  but  achieves  much  greater 
efficiency  by  combining  it  with  the  Bron-Kerbosch  algorithm. 

Makino  and  Uno  ED  list  all  maximal  cliques  in  graphs  with  maximum  degree  A 
in  time  0(4 4 )  per  clique.  For  graphs  with  degeneracy  d,  A  can  be  any  value  between 
d  and  n—  1,  making  a  meaningful  comparison  with  our  algorithm  difficult.  Therefore, 
for  graphs  with  constant  degeneracy,  their  time  bound  is  a  factor  A 4  slower  than  our 
algorithm  in  the  worst  case,  which  may  be  a  constant  factor,  or  much  worse. 

6  Conclusion 

We  have  presented  theoretical  evidence  for  the  fast  performance  of  the  Bron-Kerbosch 
algorithm  for  finding  cliques  in  graphs,  as  has  been  observed  in  practice.  We  observe 
that  the  problem  is  fixed-parameter  tractable  in  terms  of  the  degeneracy  of  the  graph, 
a  parameter  that  is  expected  to  be  low  in  many  real-world  applications,  and  that  a 
slight  modification  of  the  Bron-Kerbosch  algorithm  performs  optimally  in  terms  of  the 
degeneracy. 

We  explicitly  prescribe  the  order  in  which  the  Bron-Kerbosch  algorithm  processes 
the  vertices  of  the  graph,  something  that  has  not  been  considered  before.  Without  this 
particular  order,  we  do  not  have  a  bound  on  the  running  time.  It  would  be  interesting  to 
determine  whether  a  random  order  gives  similar  results,  as  this  would  further  explain 
the  observed  performance  of  implementations  of  Bron-Kerbosch  that  do  not  use  the 
degeneracy  order. 

It  would  also  be  of  interest  to  determine  the  largest  possible  number  of  maximal 
cliques  among  all  77-vertex  graphs  with  degeneracy  d  (or,  slightly  differently,  at  most  d), 
under  less  restrictive  assumptions  than  those  made  in  Theorem[3] 
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A  Appendix 

A.l  The  Degeneracy  of  Protein-Protein  Interaction  Networks 

The  Biological  General  Repository  for  Interaction  Datasets  (BioGRID)  ]46l  Qis  a 
curated  database  containing  the  interactions  between  proteins  that  have  been  published 
in  the  literature,  and  those  that  have  been  discovered  through  high-throughput  screening 
methods.  Using  this  interaction  data,  we  can  form  a  graph  by  creating  a  vertex  for 
each  protein  and  an  edge  between  two  proteins  that  interact.  Such  a  graph  is  called  a 
protein-protein  interaction  (PPI)  network. 

We  computed  the  degeneracy  of  seven  PPI  networks  in  version  3.0.65  of  the  BioGRID 
database.  We  chose  to  omit  other  networks  in  the  BioGRID  database  because  they  contain 
only  a  handful  of  known  proteins  and  interactions.  The  results  are  summarized  in  Table [T] 
Observe  that,  for  each  PPI  network,  the  degeneracy  is  significantly  lower  than  both  the 
number  of  vertices  in  the  graph  and  the  maximum  degree  of  the  graph.  We  therefore 
conclude  that  the  degeneracy  of  these  PPI  networks  is  low  and  that  fixed-parameter 
algorithms  parametrized  by  degeneracy  can  be  expected  to  perform  well  on  these  graphs. 


Table  1.  Graph  statistics  for  seven  PPI  networks  from  version  3.0.65  of  the  BioGRID  database. 
For  each  network  we  list  the  number  of  vertices  (n),  the  number  of  edges  (in),  the  maximum  degree 
(A),  and  the  degeneracy  (d). 


Organism  (PPI  network) 

n 

m 

A 

d 

Mus  musculus 

1455 

1636 

111 

6 

Caenorhabditis  elegans 

3518 

6531 

523 

10 

Arabisopsis  thaliana 

1745 

3098 

71 

12 

Drosophila  melanogaster 

7282 

24894 

176 

12 

Homo  Sapiens 

9527 

31182 

308 

12 

Schizosaccharomyces  pombe 

2031 

12637 

439 

34 

Saccharomyces  cerevisae 

6008 

156945 

2557 

64 

1  http://thebiogrid.org/ 


